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ABSTRACT  ** 

We  consider  a  distributed  team  decision  problem  in  which  different  agents  obtain  from  the  environ¬ 
ment  different  stochastic  measurements,  possibly  at  different  random  times,  related  to  the  same 
uncertain  random  vector.  Each  agent  has  the  same  objective  function  and  prior  probability  distribu¬ 
tion.  We  assume  that  each  agent  can  compute  an  optimal  tentative  decision  based  upon  his  own 
observation  and  that  these  tentative  decisions  are  communicated  and  received,  possibly  at  random 
times,  by  a  subset  of  other  agents.  Conditions  for  asymptotic  convergence  of  each  agent’s  decision 
sequence  and  asymptotic  agreement  of  all  agents’  decisions  are  derived. 


^  *  Research  was  supported  by  the  Office  of  Naval  Research  under  grant  ONR/NOOO 14-77-00532 
CL,  (NR  041-519) 

CO  **Room  35-406,  Laboratory  for  Information  and  Decision  Systems,  Massachusetts  Institute  of 
Li  i  Technology,  Cambridge,  Mass.,  02139. 

This  paper  was  submitted  to  the  1982  IEEE  Conference  on  Decision  and  Control  and  to  the  IEEE 
Transactions  on  Automatic  Control. 


piis  document  has  been  approved 
lor  public  release  and  sale;  its 

dhrtV{ion  js  unlimited. 


010 


NWBNS^W- 


<  ’'  J  £• 


Consider  the  following  situation:  A  set  {1  of  N  agents  possessing  a  common  model  of 

the  world  (same  prior  probabilities)  and  having  the  same  cost  function  want  to  make  a  decision.  Each 
agent  bases  his  decision  on  a  set  of  observations  he  has  obtained  and  we  allow  these  observations  to 
be  different  for  each  agent.  Given  this  setting,  the  decisions  of  the  agents  will  be  generally  unrelated. 
Aumann  [1]  has  shown,  however,  that  agreement  is  guaranteed  in  the  following  particular  case: 

If  the  decision  to  be  made  is  the  evaluation  of  the  posterior  probability  of  some  event  and  if  all 
agents'  posteriors  are  common  knowledge,  then  all  agents  agree.  (In  Aumann’s  terminology,  common 
knowledge  of  an  event  means  that  all  agents  know  it,  all  agents  know  that  all  agents  know  it,  and 
so  on,  ad  infinitum.  ) 

The  situation  where  each  agent’s  posterior  is  common  knowledge  is  very  unlikely,  in  general. 

On  the  other  hand,  if  agreement  is  to  be  guaranteed,  posteriors  have  to  be  common  knowledge.  The 
problem  then  becomes  how  to  reach  a  state  of  agreement  where  decisions  are  common  knowledge, 
starting  from  an  initial  state  of  disagreement. 

Geanakoplos  and  Polemarchakis  [6]  and  Borkar  and  Varaiya  [5]  gave  the  following  natural 
solution  to  the  above  problem:  Namely,  agents  start  communicating  to  each  other  their  tentative 
posteriors  (or,  in  the  formulation  of  (5]  the  conditional  expectation  of  a  fixed  random  variable)  and 
then  update  their  own  posterior,  taking  into  account  the  new  information  they  have  received.  In 
the  limit,  each  person’s  posterior  converges  (by  the  martingale  convergence  theorem)  and  assuming 
that  ’‘enough"  communications  have  taken  place,  they  all  have  to  converge  to  a  common  limit. 

The  above  results  hold  even  when  each  agent  obtains  additional  raw  observations  during  the 
adjustment  process  and  when  the  history  of  communications  is  itself  random.  Similar  results  were 
also  proved  for  a  detection  problem  [5]. 

A  related  -  and  much  more  general  situation  -  is  the  subject  of  this  paper;  we  assume  that  the 

agents  are  not  just  interested  in  obtaining  an  optimal  estimate  or  a  likelihood  ratio,  but  their  objective 

is  to  actually  minimize  an  arbitrary  cost  function.  In  this  setting,  it  is  reasonable  to  assume  that 

agents  communicate  to  each  other  tentative  decisions.  That  is,  at  any  time,  an  agent  computes  an _ 

optimal  decision  given  the  information  he  possesses  and  communicates  it  to  other  agents.  Whenever 

an  agent  receives  such  a  message  from  another  agent,  his  information  essentially  increases  and  he  will,  £ 

in  general,  update  his  own  tentative  decision,  and  so  on.  In  the  sequel  we  prove  that  the  qualitative  " 

results  obtained  in  (5),  (6]  for  the  estimation  problem  (convergence  and  asymptotic  agreement)  are _ _ 

also  valid  for  the  decision  making  problem  for  several,  quite  general,  choices  of  the  structure  of  the - y 
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cost  function.  However,  tentative  decisions  do  not  form  a  martingale  sequence  and  a  substantially 
different  mathematical  approach  is  required.  Wc  point  out  that  estimation  problems  are  a  special 
case  of  the  decision  problems  studied  in  this  paper,  being  equivalent  to  the  minimization  of  the  mean 
square  error. 

A  drawback  of  the  above  setting  is  that  each  agent  is  assumed  to  have  an  infinite  memory.  We 
have  implicitly  assumed  that  the  knowledge  of  an  agent  can  only  increase  with  time  and,  therefore, 
he  has  to  remember  the  entire  sequence  of  messages  he  has  received  in  the  past.  There  is  also 
the  implicit  assumption  that  if  an  agent  receives  additional  raw  data  from  the  environment,  while 
the  communication  process  is  going  on,  these  data  arc  remembered  forever.  These  assumptions 
are  undesirable,  especially  if  the  agents  are  supposed  to  be  humans,  because  limited  memory  is  a 
fundamental  component  of  the  bounded  rationality  behavior  of  human  decision  makers.  We  will 
therefore  relax  the  infinite  memory  assumption  and  allow  the  agents  to  forget  any  portion  of  their 
past  knowledge.  We  only  constrain  them  to  remember  their  most  recent  decision  and  the  most  recent 
message  (tentative  decision)  coming  from  another  agent.  We  then  obtain  results  similar  to  those 
obtained  for  the  unbounded  memory  model,  although  in  a  slightly  weaker  sense. 

A  particular  problem  of  interest  is  one  in  which  all  random  variables  arc  jointly  Gaussian  and  the 
cost  is  a  quadratic  function  of  an  unknown  state  of  the  world  and  the  decision.  It  was  demonstrated 
in  [5]  that  the  common  limit  to  which  decisions  converge  (for  the  estimation  problem)  is  actually 
the  centralized  estimate,  i.e.  the  estimate  that  would  be  obtained  if  all  agents  were  to  communicate 
their  detailed  observations.  We  prove  (section  4)  that  the  same  is  true  in  the  presence  of  memory 
limitations,  provided  that  each  agent  never  forgets  his  own  raw  observations.  (That  is,  he  may  only 
forget  past  tentative  decisions  sent  to  him  by  other  agents.) 

We  end  this  section  with  a  discussion  of  the  nature  of  our  model  and  the  possible  range  of  its 
validity.  The  first  point  to  be  made  is  that  there  are  strong  underlying  rationality  assumptions  if  we 
postulate  that  our  model  describes  the  way  that  humans  update  their  decisions,  even  if  we  constrain 
them  to  have  limited  memory.  The  decision  of  an  agent  at  some  stage  is  the  optimal  decision  given 
his  observations  and  some  tentative  decisions  of  other  agents  that  have  been  communicated  to  him. 
Except  for  particular  problems  such  as  linear  quadratic  gaussian  decision  problems,  this  corresponds 
to  a  very  hard  optimization  problem.  The  reason  is  that  the  other  agents'  tentative  decisions  rep¬ 
resent  an  indirect  way  of  communicating  information.  Essentially,  when  agent  A  receives  agent  B'a 
decision,  he  tries  to  deduce  B'a  observations  from  B’a  decision.  So,  one  agent  combines  information 
to  obtain  an  optimal  decision  and  the  other  agent  tries  to  invert  this  map  and  recover  the  original 
information.  While  this  is  mathematically  meaningful,  it  is,  in  general,  beyond  the  information 
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processing  capabilities  of  human  decision  makers.  In  view  of  the  above,  we  propose  the  following 
interpretation:  while  it  is  not  true  that  human  decision  makers  will  actually  do  the  computations 
prescribed  by  the  model,  a  group  of  well-trained  individuals,  very  familiar  with  the  decision  problem 
at  hand  will  make  tentative  decisions  dose  to  the  ones  predicted  by  the  model  and  will  agree  in  the 
'imit.  Bacharach  [2,  p.183]  gives  some  support  to  the  above  statement  by  citing  evidence  that  “in 
practice  awareness  of  others’  opinions  tends  to  modify  one’s  own:  in  the  casi  of  factual  questions  a 
general  tendency  to  converge  has  often  been  observed”. 

A  second  weak  point  of  the  model  is  that  not  only  each  agent  has  the  same  prior  information  and 
knows  the  statistics  of  the  other  agents’  observations  but  also  has  the  same  model  of  the  probabil¬ 
istic  mechanism  that  generates  inter-agent  communications.  In  particular,  if  this  is  a  deterministic 
mechanism,  an  agent  must  know  the  precise  history  of  communications  between  any  pair  of  other 
agents,  a  very  strong  requirement.  This  weakness  disappears,  however,  if  every  tentative  decision 
is  broadcasted  simultaneously  to  all  other  agents.  This  will  be  the  case,  for  example,  if  a  set  of 
experts  with  the  same  objective  telcconfer  and  take  turns  into  suggesting  what  they  believe  to  be 
the  optimal  decision. 

Finally,  we  point  out  that  our  scheme  is  fundamentally  different  from  schemes  for  distributed 
decision  making  and  computation  in  which  each  agent  controls  his  own  decision  variable  and  the 
cost  function  depends  jointly  on  all  agents’  control  variables  (3,  7],  What  we  have  here  is  a  common 
decision  variable  which  has  to  be  jointly  fixed  by  a  set  of  decision  makers. 


II.  MODEL  FORMULATION. 


In  this  section  we  present  a  mathematical  formulation  of  the  model  informally  described  in  the 
introduction.  We  start  with  a  most  general  set  of  assumptions  and  later  proceed  to  the  development 
of  alternative  specialized  models  to  be  considered  (e.g.  memory  limitations,  particular  forms  of  the 
cost  function  etc.).  As  far  as  the  description  of  the  sequence  of  communications  and  updates  goes, 
we  basically  adopt  the  model  of  Borkar  and  Varaiya  [5]  except  that  time  is  considered  to  be  discrete. 
As  in  [5],  events  are  timed  with  respect  to  a  common,  absolute  clock.  As  far  as  notation  is  concerned, 
we  will  use  subscripts  to  denote  time  and  superscripts  to  denote  agents. 

We  assume  that  we  are  given  a  set  {1  of  N  agents,  an  underlying  probability  space 

(ft,  5 F,  and  a  scalar  valued  cost  function  c;ft  X  V  R,  where  U  is  the  set  of  admissible  values 
of  the  decision  variable.  It  will  be  useful  in  the  sequel  to  distinguish  between  elements  of  U  and  U~ 
valued  random  variables.  The  letter  v  will  be  used  to  denote  elements  of  U  whereas  u,  w  will  be 
used  to  denote  {/-valued  random  variables  (measurable  functions  from  ft  to  U). 

Our  first  assumption  is  of  a  purely  technical  nature  and  is  satisfied  by  almost  all  probability 
spaces  arising  in  applications.  The  following  two  assumptions,  however,  are  of  a  more  essential 
nature. 

Assumption  1:  ft  is  a  Polish  (complete,  separable,  metric)  space  and  9  is  the  associated  Borel  o-field 
(the  <x-ficld  generated  by  the  open  subsets  of  ft). 

Assumption  2:  Either 

(2.1) :  U  is  a  finite  set,  or 

(2.2) :  U  is  a  closed,  convex  subset  of  Rn. 

♦  Assumption  3:  The  cost  function  c  is  nonnegative  and  jointly  measurable  in  (w,  v).  Moreover, 
jE[c(v)]  <  oo,  Vv  £  U.  When  assumption  (2.2)  holds,  we  assume  that  there  exists  a  positive, 
measurable  and  integrablc  function  A:ft  *->  R  such  that 

A(w)||vj  -  vi\\2<\[c{u>,  m)  +  c(w,  t*)]  -  c(w,  Vu;  €  ft,  »*  €  U.  (1) 

The  last  part  of  assumption  3  states  that  c  is  a  strictly  convex  function  of  v  and  strict  convexity 
holds  in  a  uniform  way,  for  any  fixed  u  €  0.  This  assumption  is  satisfied,  in  particular,  if  c  is  twice 
continuously  differentiable  in  v  and  its  Hessian  is  positive  definite,  uniformly  in  v,  for  any  fixed 
w  €  ft.  It  is  also  satisfied  if  c  is  strictly  convex  snd  U  is  a  compact  set. 

We  may  use  the  function  A,  defined  in  assumption  3,  to  define  a  new  measure  ft  on  (ft,  9}  by 

M(5)=  f  A(w)d*(w),  5.  (2) 

JB 
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This  measure  will  be  very  useful  in  section  3. 

We  now  consider  the  generic  situation  facing  agent  i  at  some  time  n.  Let  °JJ(  Cl  6J  be  a  w-ficld 
of  events  describing  the  information  possessed  by  agent  i  at  time  n.  Because  of  assumption  3,  the 
conditional  expectation  exists  (is  finite),  is  “J^-mcasurablc  and  is  uniquely  determined  up 

to  a  set  of  measure  zero  [10|,  for  any  fixed  ugtf.  Agent  t  then  computes  a  tentative  decision  u*n 
that  minimizes  £[c(u)|,'F)l]. 

We  are  now  forced  into  a  mathematical  digression  that  will  ensure  that  u|,  is  well-defined  and 
determines  a  ^-measurable  random  variable. 

We  need  to  pay  special  attention  only  for  the  convex,  continuous  problem  (assumptions  2.2 
and  3).  We  introduce  another  technical  assumption  describing  the  kind  of  knowledge  an  agent  may 
possess  at  some  point  in  time.  Similarly  with  assumption  1,  it  poses  no  real  restriction.  Roughly 
speaking,  it  stales  that  the  knowledge  of  an  agent  can  be  described  by  a  Polish  probability  space. 
Assumption  4:  If  9),  C  9  describes  the  knowledge  of  agent  i  at  time  n,  there  exists  a  Polish  prob¬ 
ability  space  (0*,9*,9*)  and  a  mapping  /: 0  i-»  fl*  such  that  9J,  is  the  smallest  a-field  that  makes 
/  measurable  and  9*(Z?)  =  9(/—  '(O))  ,  V2?  €  9*. 

We  then  have  the  following  Lemma  (proved  in  the  Appendix): 

Lemma  1:  Under  assumptions  I,  2.2,  3,  4,  there  exists  an  almost  everywhere  unique,  ^-measurable 
random  variable  u),  such  that 

£,(c(un)l^,n]^£'lc(u;)l^,rJ>  almost  surely,  (3) 

for  any  fZ-valued,  9 ^-measurable  random  variable  w.  Moreover,  F[c(uj,))  =  E[c(u>)]  for  some  'im¬ 
measurable  w  if  and  only  if  u),  =  w,  almost  everywhere.  The  same  results  are  true,  (except  for 
uniqueness)  under  assumption  2.1. 

We  continue  with  a  description  of  the  process  of  communications  between  agents.  When,  at  time 
n,  agent  i  computes  his  tentative  optimal  decision  he  may  communicate  it  to  any  other  agent. 
(If  is  not  unique,  a  particular  minimizing  uj,  is  selected  according  to  some  commonly  known  rule.) 
Whether,  when  and  to  which  agents  u‘n  is  to  be  sent  is  a  random  event  whose  statistics  are  described 
by  (0,9,9).  We  also  allow  the  transmission  time  to  be  random  but  finite.  We  assume  that  when  an 
agent  receives  a  message  he  knows  the  identity  of  the  agent  who  sent  it. 

Note  that  the  time  at  which  an  agent  receives  a  message  is  a  random  variable  on  (0,9,9). 
As  pointed  out  in  [5],  this  allows  the  possibility  of  signalling  additional  information,  beyond  that 
contained  in  uj,,  by  appropriately  choosing  when  and  to  which  agents  to  communicate. 


We  now  impose  conditions  on  the  number  of  messages  to  be  communicated  in  the  long  run; 
t  hese  conditions  arc  necessary  for  agreement  to  be  guaranteed.  Namely,  we  require  that  there  is  an 
indirect  communication  link  from  any  .agent  to  any  other  agent  which  is  used  an  infinite  number  of 
times.  This  can  be  made  precise  as  follows: 

Let  A(t)  be  the  set  of  all  agents  that  send  an  infinite  number  of  messages  to  agent  i,  with 
probability  i.  Then,  v-e  make  the  following  assumption: 

Assumption  5:  There  is  a  sequence  mi, . . mfc(.  |  =  mi  of  not  necessarily  distinct  agents  such  that 
rrii  £  /t(mj4-i),  *  =  1,2,..  .,k.  Each  agent  appears  at  least  once  in  this  sequence. 

The  main  consequence  of  assumption  5,  which  will  be  repeatedly  used,  is  the  following:  If 
{hL.  I, . .  ,,N}  is  a  set  of  numbers  such  that  h'<hJ,  V/  £  A(t),  Vi,  then  h*  —  hJ,  Vi,j. 

We  continue  with  a  more  detailed  specification  of  the  operation  of  the  agents.  We  introduce 
assumptions  on  the  knowledge  which  arc  directly  related  to  the  properties  of  the  memory  of  agent 
i.  An  agent  may  receive  (at  any  time)  observations  on  the  state  of  the  world  or  receive  tentative 
decisions  (messages)  of  other  agents.  The  knowledge  of  an  agent  at  some  time  will  be  a  subset 
(depending  on  the  properties  of  his  memory)  of  the  total  information  he  has  received  up  to  that 
time.  We  consider  three  alternative  models  of  memory,  formalized  with  the  three  assumptions  that 
follow. 

Let  w'n  be  the  last  message  received  by  agent  i  up  to  (and  including)  time  n.  Our  most  general 
assumption  requires  that  wln  and  uj,_j  are  remembered  at  time  n: 

Assumption  6:  (Imperfect  Memory)  The  <7-fieId  ?{,  is  such  that  uj,_,  and  toj,  are  ^-measurable. 
Remark:  Assumption  6  makes  sense  only  because  of  Lemma  1  which  guarantees  that  uj,_!  and  ti/j, 
are  9- measurable.  We  can  then  define,  inductively,  5),  to  be  the  smallest  sub-<7-field  of  9  with 
respect  to  which  «),_(,  wln  and  any  other  information  rememberd  are  measurable. 

Assumption  7:  (Imperfect  Memory;  Own  Data  Remembered)  Let  G'n  be  the  subfield  of  9  describing 
all  information  that  has  been  observed  by  agent  t  up  to  time  n,  except  for  the  messages  of  other 
agents.  We  assume  that  assumption  6  holds  and  that  G\  C  9‘n. 

Assumption  8:  (Perfect  Memory)  We  let  assumptions  6  and  7  hold  and  assume  that  9},  C  n+i, 
V»,n. 

So,  with  assumption  6  we  only  assume  that  an  agent  remembers  the  content  of  the  last  message  he 
received,  as  well  as  his  last  tentative  decision.  With  assumption  7  we  also  assume  that  he  remembers 
all  his  past  observations  but  is  still  allowed  to  forget  the  contents  of  the  past  messages  he  received. 
In  this  case  the  total  information  available  to  all  agents  is  preserved.  Finally,  assumption  8  implies 
that  the  knowledge  of  an  agent  can  only  increase.  Whenever  assumption  8  holds,  we  will  denote  by 
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9^  the  smallest  o-ficld  in  °S  containing  for  all  n. 

We  conclude  this  section  by  defining  a  few  problems  of  particular  interest: 

(i)  Estimation  Problem:  Given  a  /l"-valucd  random  vector  x  on  (0,9,9),  let  U  =  Rn  and  c(v)  = 
(x  —  v)1  (x  —  v),  where  T  denotes  transpose.  It  is  easy  to  see  that  this  is  a  particular  case  of  a  strictly 
convex  function  covered  by  assumption  3,  with  /l(u>)  being  a  constant. 

(ii)  Static  Linear  Quadratic  Gaussian  Decision  Problem  (LQG):  Let  x  be  an  unknown  random  vec¬ 
tor.  Let  the  sequence  of  transmission  and  reception  times  be  deterministic.  We  assume  that  the 
random  variables  observed  by  the  agents  are,  together  with  x,  jointly  normally  distributed.  We 
allow  the  total  number  of  observations  to  be  infinite,  although  wc  consider  separately  the  finite¬ 
dimensional  problem.  Let  U  —  Rn  and  let  the  cost  function  be  c(u)  =  vTRv  -f  xrQv,  with  R  >  0.  It 
follows  that  the  optimal  tentative  decision  of  agent  t  at  time  n  is  uln  —  GE{x |9JJ=  £'[Gx|'J(J1  where 
G  is  a  prccomputable  matrix.  If  we  redefine  the  unknown  vector  x  to  be  equal  to  Gx  instead  of  x, 
we  conclude  that  we  may  restrict  to  estimation  problems,  without  loss  of  generality. 

(iii)  Finite  Probability  Spaces:  Here  wc  let  O  be  a  finite  set.  Then,  there  exist  finitely  many  o-fields  of 
subsets  of  H.  It  follows  that  tentative  decisions  can  take  finitely  many  values.  We  therefore  assume, 
without  loss  of  generality,  that  U  is  also  a  finite  set. 
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HI  CONVERGENCE  AND  AGREEMENT  RESULTS. 

In  this  section  we  state  and  discuss  our  main  results.  Ail  proofs  can  be  found  in  the  Appendix. 
Assumptions  1,  3,  4  and  5  will  be  assumed  throughout  the  rest  of  the  paper  and  will  not  be  explicitly 
mentioned  in  the  statement  of  each  Theorem.  We  start  with  the  least  restrictive  assumptions  on 
memory: 

Theorem  I:  Under  assumptions  2.2  (convex  costs),  6  (imperfect  memory)  and  deterministic  trans¬ 
missions  and  receptions: 

a)  Iim„— ao(u’n+1  —  uj,)  =  0,  in  probability  and  in  1,2(0, 5,  ju)- 

b)  lim„_oo(«n  ~~  ui)  —  Vi,j,  in  probability  and  in  Li(Cl,9,n). 

Consider  the  following  situation:  At  time  zero,  before  any  observations  are  obtained,  the  se¬ 
quence  of  transmissions  and  receptions  is  selected  in  random,  according  to  a  statistical  law  which 
is  independent  from  all  observations  to  be  obtained  in  the  future  and  from  c(v),  for  any  v  G  U.  In 
other  words,  communications  do  not  carry  any  information  other  than  the  content  of  the  message 
being  communicated  (no  signalling  allowed).  Suppose  that  the  sequence  of  communications  that  has 
been  selected  becomes  known  to  all  agents.  From  that  point  on,  the  situation  is  identical  with  that 
of  deterministic  communications.  In  fact,  a  moment’s  thought  will  show  that  it  is  sufficient  for  the 
history  of  communications  to  become  commonly  known  as  it  occurs:  agent  t  only  needs  to  know,  at 
time  n,  what  communications  have  occured  up  to  that  time,  so  that  he  can  interpret  correctly  the 
meaning  of  the  messages  he  is  receiving. 

We  can  formalize  these  ideas  as  follows:  We  are  given  a  product  probability  space  (0  X  Cl*,  9  x 
5*,  9  y  <y*)  where  (0,5,9)  satisfies  assumptions  1  and  4  (for  any  sequence  of  communications)  and 
where  (Cl*,  9*,  9*)  describes  the  communications  process.  For  each  w*  6  Cl*,  we  obtain  a  distributed 
decision  problem  on  (0,9,9)  with  deterministic  communications.  In  that  case: 

Theorem  2:  Under  assumptions  2.2,  6  and  independent  communications  (as  described  above), 
limn-^ofuj^t  —  ujj  a*  limfWOO(u£l  —  u£)  0,  in  probability  with  respect  to  9  x  9*. 

Strictly  speaking,  Theorems  I  and  2  do  not  guarantee  convergence  of  the  decisions  of  each  agent. 
Suppose,  however,  that  the  agents  operate  under  the  following  rule:  Fix  some  small  7  >  0.  Let  the 
sequence  of  communications  and  updates  of  tentative  decisions  take  place  until  |u},  —  u£|  <  7,  Vi,  3 
(small  disagreement)  and  luj^  —  uj,|  <  7,  Vi  (small  foreseeable  changes  in  tentative  decisions). 
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Then,  we  obtain: 


Corollary  1:  With  the  above  rule  and  the  assumptions  of  Theorems  1  or  2,  the  process  terminates  ' 
in  finite  time,  with  probability  1,  for  any  7  >  0. 

When  f!  and  U  are  finite,  we  have: 

Theorem  3:  If  0  and  U  are  finite  sets,  if  each  agent  communicates  all  the  values  or  v  that  minimize 
£T[c(v)|^FJJ  and  if  assumption  6  holds,  then  there  exists  some  positive  integer  Q  such  that 

UQ  ~  uh'  an<1  UQ+n  ~  UQ>  Vi>  Vn>  6  n- 


Strictly  speaking,  tentative  decisions  in  the  above  theorem  are  not  elements  of  U  but  subsets 
of  U.  This  is  due  to  the  non-uniqueness  of  optimal  tentative  decisions,  in  the  absence  of  convexity. 
The  equalities  appearing  in  Theorem  3  have  to  be  interpreted,  therefore,  as  equalities  of  sets. 

We  now  assume  that  the  agents  have  perfect  memory.  We  obtain  results  similar  to  theorems  1 
and  2  under  much  more  relaxed  assumptions  on  the  communications  process.  Namely,  we  only  need 
to  assume  the  following: 

Assumption  9:  Let  M]/  be  the  k-th  message  sent  by  agent  j  to  agent  i.  We  assume  that  when  agent 
t  receives  Mjf,  he  knows  that  this  is  indeed  the  fc-th  message  sent  to  him  by  agent  j. 

Remark:  This  assumption  is  trivially  satisfied  if  messages  arrive  at  exactly  the  same  order  as  they 
are  sent,  with  probability  1. 

Theorem  4:  Under  assumptions  2.2  (convex  costs),  8  (perfect  memory)  and  9,  there  exists  a  U - 
valued  random  variable  u*  such  that  limn_oeuJl  =  u*,  Vi,  in  probability  and  in  L2 (0,5,/*). 

For  estimation  problems  (uj,  =  Ejil'fjj),  theorem  4  can  be  slightly  strengthened:  [5,  Theorem  1] 

Theorem  5:  For  estimation  problems,  under  the  assumptions  of  theorem  4,  convergence  to  u*  takes 
place  with  probability  1. 

We  now  consider  the  case  where  U  is  finite  but  (unlike  theorem  3)  Q  is  allowed  to  be  infinite. 
Several  complications  may  arise,  all  of  them  due  to  the  fact  that  optimal  decisions,  given  some 


information,  are  not  guaranteed  to  be  unique.  We  discuss  these  issues  briefly,  in  order  to  motivate 
the  next  theorem. 

Suppose  that  U  —  It  is  conceivable  that  £[c(t’i)|6J),]  —  £r(c(v2)|cJ*tl]  is  never  zero  and 

changes  sign  an  infinite  number  of  times,  on  a  set  of  positive  probability.  In  that  ease,  the  deci¬ 
sions  of  agent  t  do  not  convegc.  Even  worse,  it  is  conceivable  that  E(c(t/i)|*f|J  >  E[c(v2)|'J|J  and 
E[c(ui)|cJ{]  <  ,  for  all  n  and  for  all  w  in  a  set  of  positive  probability,  in  which  case  agents  t 

andy  disagree  foever.  It  is  not  hard  to  show  that  in  both  of  the  above  cases  £[o(vi)|<if^c]  —  E|c(t>2)|9£J, 
on  a  set  of  positive  probability  and  this  non-uniqueness  is  the  source  of  the  pathology.  The  following 
theorem  states  that  convergence  and  agreement  arc  still  obtained,  provided  that  we  explicitly  exclude 
the  possibility  of  non-uniqueness. 

Theorem  6:  Under  assumptions  2.1  (finite  U)  and  8  (perfect  memory)  and  if  the  random  variable  u‘ 
that  minimizes  E(c(u>)]  over  all  “if^-measurable  random  variables  is  unique  up  to  a  set  of  measure 
zero,  for  all  i,  then  lim„~oo  t-0,  =  u‘,  almost  surely,  and  u*  =  uJ,  Vi,  j. 

Although  the  preceding  theorems  guarantee  that  (under  certain  conditions)  all  agents  will  agree, 
nothing  has  been  said  concerning  the  particular  decision  to  which  all  agents’  decisions  converge.  In 
particular,  it  is  not  necessarily  true,  as  one  would  be  tempted  to  conjecture,  that  the  limit  decision  is 
the  optimal  centralized  solution  (that  is,  the  solution  to  be  obtained  if  all  agents  were  to  communicate 
all  their  information).  On  the  other  hand,  the  centralized  solution  is  reached  for  LQG  problems, 
under  the  perfect  memory  assumption  [5]  and  is  also  reached  gcnerically  for  an  estimation  problem 
on  a  finite  probability  space  [6].  This  issue  will  be  touched  again  in  the  next  section. 
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IV.  THE  LINEAR  QUADRATIC  GAUSSIAN  (LOG)  MODEL. 


In  this  section  we  specialize  and  strengthen  some  of  onr  results  by  restricting  to  the  Linear  Quadratic 
Gaussian  model  described  in  section  5.  (Recall  that  any  such  problem  is  equivalent  to  an  estimation 
problem;  therefore,  u),  =  i"  =  for  some  random  vector  x.)  Theorems  I,  3  and  4  are 

applicable.  Moreover,  the  results  of  (5j  guarantee  that,  under  assumption  8  (perfect  memory),  it), 
converges  to  the  optimal  centralized  estimate,  given  the  information  possessed  by  all  agents.  The 
following  theorem  states  that  the  same  is  true  under  the  weaker  assumption  7. 

Theorem  7:  For  the  LQG  problem  and  under  assumption  7  (imperfect  memory;  own  data  remem¬ 
bered)  limtl— x'n  =  x,  in  the  mean  square,  where  x  =  E^l^cx,]  and  “Jo  is  the  smallest  o-field 
containing  for  all  t,  rt. 

Note  that  theorem  7  is  much  stronger  than  theorem  1  which  was  proved  for  the  general  case  of 
imperfect  memory.  We  have  here  convergence  to  a  limit  solution  which  is  also  guaranteed  to  be  the 
optimal  centralized  solution. 

Our  final  result  concerns  the  finite  dimensional  LQG  problem  in  which  the  total  number  of 
observations  is  finite.  Namely,  the  smallest  cr-field  containing  Sf),  for  all  t,  n  is  generated  by  a  finite 
number  of  (jointly  Gaussian)  random  variables.  In  that  case,  the  centralized  solution  is  going  to  be 
reached  by  all  agents  in  a  finite  number  of  stages,  provided  that  all  agents  have  perfect  memory. 

Theorem  8:  For  the  LQG  problem  with  finitely  many  observations  and  under  assumption  8  (perfect 
memory),  the  centralized  solution  is  reached  by  all  agents  in  a  finite  number  of  stages. 

Theorems  7  and  8  imply  that  the  scheme  considered  in  this  paper  may  be  viewed  as  a  distributed 
way  of  solving  static  linear  estimation  problems. 


V.  CON  CL 


A  set  of  agents  with  the  same  objective  who  start  communicating  to  each  other  their  tentative 
optimal  decisions  arc  guaranteed  to  agree  in  the  limit.  This  is  true  even  if  their  memory  is  limited 
and  they  are  allowed  to  forget  some  of  their  past  knowledge.  Moreover,  they  arc  guaranteed  to 
converge  to  the  optimal  centralized  decision  for  linear  estimation  problems,  provided  that  they  do 
not  forget  their  own  observations.  These  results  arc  valid  when  ail  agents  share  the  same  model  of 
the  world.  The  characterization  of  the  behavior  of  agents  with  dilTcrcnt  models  (perceptions)  is  an 
open  problem. 
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Proof  of  Lemma  1:  By  theorem  1.1.6  and  corollary  1.1.7  of  [9],  there  exists  a  condtional  probability 
distribution  {<?„,:  u>  £  fl)  of  7  given  7l„  and,  for  any  t;  £  17,  a  version  of  E[c(v)|7|J  such  that 

EHvWJ  =  j^dQ^.  (4) 

Equation  (4)  and  assumption  3  imply  that  £[c(v)|7jj  is  a  strictly  convex  function  of  v,  diverging  to 
infinity  as  ||t>||-+oo,  Vu>  £  fl. 

By  assumption  4,  there  exists  a  Polish  probability  space  (fl*,7*,7*)  and  a  mapping  /:fl  t-+  fl* 
such  that  7),  is  the  smallest  o>field  in  7  with  respect  to  which  f  is  measurable.  Moreover,  7*  is  the 
measure  induced  by  9;  namely,  9*(B)  =  9(f~x(B)),  VJ3  £  7*.  We  then  use  theorem  T18  (p.  10)  of 
[8]  and  the  fact  that  (fl*,7*)  is  isomorphic  to  (S,  7),  where  S  Cl  R  and  9  is  the  Borel  <7- field  on  5 
[4,  p.122],  to  conclude  that  there  exists  a  measurable  function  c*:l7  X  fl*  i -*  R  such  that 

c*(o,/(w))  =  E[c{v)|7,„](w)  (5) 

Moreover,  c*  is  strictly  convex  in  v  and  its  minimum  is  attained  for  any  fixed  w*  £  fl*.  By  the 
results  of  [4]  on  measurable  selections  (see  in  particular  the  discussion  in  page  259)  there  exists  a 
universally  measurable  function  u*:fl*  >-*  U  such  that 

c*(u*(w*),w*)^c*(v,w*),  Vv  £  u,  Vw*  £  fl*.  (6) 

We  now  modify  ii*(io*)  on  a  set  of  7*-measure  zero,  to  obtain  a  Borel  measurable  function  u*:fl*  *-*  U 
such  that 

c*(u*(ta>*),  u>*)<e*(v,  u>*),  Vv  £  U,  7*-almost  surely  (7) 

We  finally  define  tl‘n(w)  *  &*(/(«))  and  uj,(w)  u*(/(w)).  (Note  that  tij,  is  7^-measurable, 
by  construction,  and  that  uj,(w)  minimizes  E[c(u)|7)J(u;).)  Let  B  =  (w:  and  D  = 

{u»*:u*H^fk*(w)>.  Clearly,  B  =  f~~x(D)  and  9*(D)  -  0.  So,  0  »  9*(D)  »  9(/~l(D))  —  9(B). 
Therefore,  ttj,  is  an  7),-measurable  function  that  minimizes  £[c(v)|7jj,  almost  everywhere,  and  this 
implies  the  first  part  of  the  Lemma. 

For  any  7J,-measurable,  17-valued  random  variable  w,  E{c(ujJ|7jJ<E(c(te)17))],  almost  surely, 
and  by  integrating,  EMujJ]  <E[c(  ui)]. 


Now  suppose  that  ^(cfujj]  =  E(c(to)j,  for  some  {/-valued,  ‘/^-measurable  random  variable  w. 
Using  the  minimizing  property  of  u'nl  wc  have 

(8) 

The  converse  inequality  also  holds,  by  the  convexity  of  c.  Therefore,  (8)  is  actually  an  equality. 
Recalling  the  fact  that  c  is  strictly  convex,  for  all  u>  £  Q,  we  conclude  that  u'n  =  w,  almost  surely. 

The  validity  of  the  same  results  (except  for  uniqueness)  under  assumption  2.1  is  immediate  from 
the  fact  that  the  minimum  of  the  finitely  many  random  variables  {£’[c(v)|‘J*J,  v  €  U}  is  itself  a 
random  variable  and  a  measurable  selection  of  uj,  is  straightforward.  | 


+  w 


i(£[cK)]  +  E[c( 


Lemma  2:  Let  {un},  {to,,}  be  two  sequences  of  C/-vaIucd  random  variables  such  that 

lim  E\c(  *  =  lim  £[c(u„)]  =  lim  E(c(u>„)]  (9) 

n— *oo  I  \  l  )\  i»— oo  n—*oo 


If  assumptions  2.2  and  3  hold,  then  limn_SM(tin  —  wn)  =  0  in  Li(n,9,ii)  and  in  probability. 
Proof:  By  assumptions  2.2,  3  and  equation  (9), 


lim  £[i4(u)||u„ 

n— *oo 


lim  E 

n— k» 


c(tin)  -f  c(Wn) 

J'Un  +  Wn\ 

2 

\  2  )\ 

0 


(10) 


which  shows  that  (u„  —  w„)  converges  to  zero  in  Lj(0,  9,  #*)•  Therefore,  it  also  converges  in  measure 
*  with  respect  to  ft. 

Recall  that  >t(w)  >  0  and  ft(B)  =  fBA(u)  d9(u/),  VB  €  9.  Therefore,  fi(B)  —  0  implies  9(B)  ■»  0 
and  9  is  absolutely  continuous  with  respect  to  fi.  Let  B*n  *  {|un  —  to„|  >  e}.  Since  (a*  —  w„)  con¬ 
verges  to  zero  in  measure  fi,  for  any  <  >  0,  we  have  li =*>  0  and,  by  absolute  continuity, 
limn^oo  9(Bln  ®  0,  which  shows  that  we  have  convergence  in  probability.  | 

Proof  of  Theorem  1:  Since  uj,  is  9^- measurable,  we  have  (by  the  minimising  property  of  uj,+1) 
£(c(un  |  l)]<^(c(un)].  Since  c  is  nonnegative,  £’(c(u^)]  converges  to  some  constant  gi.  We  also  note 
(“n+l  +  U’n)/2  U  •measurable  and  by  taking  the  limit  in  the  relation 

+ 401 2  )]  -  (“) 
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we  obtain  iimn_00E[c((u|1+j  -|-  ttJJ/2)}  =  gl.  Lemma  2  then  yields  the  first  part  of  the  theorem. 

Let  j  £  /4(t).  Then  there-exist  sequences  {mfc}  and  {nt}  of  positive  integers  such  that 
limfc—oo  mk  —  Umfe_oo  nk  =  oo  and  m*,  nk  are  the  times  of  transmission  and  reception,  respectively, 
of  the  k-ih  message  from  agent  j  to  agent  t.  Therefore,  u3mk  is  6J‘„k-mea8urable,  for  all  k,  and 
£[c(ujIJ]<£[c(u^)]  which  shows  that  ^<5'.  Using  assumption  5,  we  conclude  that  gl  —  g3,  Vt,j. 

We  note  that  (u„k  -f-  «f,lk)/2  is  5FJ(|i-measurablc  and,  therefore,  "" 


~E[c(u\)  +  c(u3)} 


*  4 -u3 

~  mu 

2 


E[«)] 


Taking  the  limit  in  (12)  and  using  Lemma  2,  we  obtain  the  second  half  of  the  theorem,  ff 

Proof  of  Theorem  2:  Theorem  1  and  the  discussion  preceding  the  statement  of  theorem  2  show  that 
limn_0o(u),+l  —  uj,)  =  0,  in  probability  with  respect  to  9,  for  all  to*  £  fi*.  Let  Xn(w,w*)  be  the 
characteristic  function  of  the  set  {(u>,w*):  —  u}J|  <  e}.  Then, 

lim  fxnd{9x9*)  —  lim  [  [  Xnd9d9*  —  [  lim  f  Xnd9d9*  =  1  (13) 

n— *oo  J  n— 00  J  J  J  n—*o o  J 

(The  first  equality  holds  by  the  Fubini  theorem;  the  second  by  the  dominated  convergence  theorem; 
the  third  by  convergence  of  (uj,+1  —  uj,)  to  zero,  with  respect  to  the  probability  measure  9.)  This 
shows  that  uj,+l  —  uj,  converges  to  zero  in  probability  with  respect  to  9  X  9*.  | 

Proof  of  Corollary  1:  By  theorem  1,  the  U  X  {/-valued  sequence  of  random  variables  (u  j,— -u£,  uj,+1— 
ujj  converges  to  (0, 0)  in  probability.  It  therefore  contains  a  subsequence  converging  to  (0, 0),  al¬ 
most  surely  [10].  Therefore,  V7  >  0  and  for  almost  all  w  £  fl,  Brio  such  that  |uj^  —  u£j  <  7, 
|uj^+,  —  u|J  <  7  and  the  termination  condition  is  eventually  satisfied  with  probability  one.  | 

Proof  of  Theorem  3:  Because  of  the  finiteness  of  fl,  there  exists  a  finite  (non-random)  time  after 
which  communications  (conditioned  on  past  events)  are  deterministic.  We  may  take  that  time  as 
the  initial  time  and  assume,  without  loss  of  generality,  that  all  communications  are  deterministic. 

Let  uj,  be  the  set  of  elements  of  U  which  are  optimal,  given  9'n.  Let  wj,  denote  5^-measurabIe 
random  variable  such  that  u/j,(w)  £  uj,(u;),  Vw  £  O.  (Note  that  E[c(u>jJ]  is  independent  of  how 
wj,  has  been  selected.)  By  finiteness  of  Cl  and  U,  there  exist  finitely  many  {/-valued  random  vari¬ 
ables  and,  since  £'[c(to^+l)j<£'[c(u'^)]l  we  conclude  that  there  exists  some  positive  integer  T  and 


some  g 1  such  that  E[c(io'J]  =  g\  Vn  >  T.  For  any  n  >  T,  EfcftvJ,)]  ==  £[c(iv|,+1)]  and  since 
tv‘n4s  ^FJl+1-ineasurable,  tv|,  minimizes  £(c(tv)]  over  all  (-measurable  random  variables.  Hence, 
tv|,(w)  G  (o>),  Viv  G  H  which  shows  that  u'n(u)  C  Vw  G  H.  Again,  by  finitcncss  of  U 

and  a  there  exists  some  positive  integer  Q  such  that  ujl+1(w)  =  uj,(w),  Vr..  >  Q,  Vw  G  fl,  Vi. 

If  j  G  /!(*),  there  exist  m,n  >  Q  such  that  tuJm  is  ^-measurable  and  this  shows  that  gi<g*. 
By  assumption  5,  we  obtain  g'  —  g\  Vi,/  Therefore,  wJm  minimizes  £[e(tv)|  over  all  ^Immeasurable 
random  variables  and,  therefore,  tvf„(u/)  G  u),(cv),  ori  um(w)  C  u|,(w),  Vu/  G  Q.  Recalling  assumption 
5,  we  obtain  u{,(u>)  =  u‘n((v),  Vi,  j,  Vm,  n  >  Q,  Vcv.  | 

Lemma  3:  Let  T  be  a  finite  stopping  time  of  an  increasing  family  {*F„}  of  o-fields.  Let  un,  n  = 
1, 2, ...  be  random  variables  that  minimize  £[c(w)(7n],  almost  surely,  over  all  ^-measurable  random 
variables  tv.  Then,  ur  minimzes  Jr7[c(to)]  over  ail  ?r-measurablc  random  variables  tv,  where  ut  —  tin 
if  and  only  if  T  —  n. 

Proof:  Let  x»  be  the  indicator  function  of  the  set  (w:  T(w)  =  n}.  Since  T  is  a  stopping  time,  x™  is 
°Jn-measurable.  Note  that  Xnc(un)  =  Xnc(ur)-  Let  tv  be  a  ly-measurable  random  variable  and  note 
that  Xn<^w)  =  Xnc{knw)  and  x„to  is  ^n-measurable.  Therefore, 

£|x«cM|S„]  =  XnE[c(xnv) l*n]  >  X,^(c(«„)|5„]  -  E|x»<Kun)(?n]  =  £[x»c(ur)|*„J  (14) 

Taking  expectations,  we  obtain 

£(Xnc(tv}j  >  £(XnC(«r))  •  (15) 

and  summing  over  all  n’s  (and  using  the  monotone  convergence  theorem  to  interchange  summation 
and  expectation)  we  obtain  £(c(tv)]  >  £[c(ur)j.  I 

Proof  of  Theorem  4:  Since  u*n  is  *F'n+l-measurablc,  we  have  ^(c(u{,+l)]<E(c(u^)].  Since  c  is  non¬ 
negative,  E(c(u'n)]  converges  to  some  constant  p*.  We  also  note  that  (uj,  -f  uJ,+m)/2  is  im¬ 
measurable.  Therefore,  £(c((tii,_)_m  +  u}|)/2)]  >  E[c(u},+m)]  >  g*.  Fut  some  e  >  0,  and  let  n  be  large 
enough  so  that  E{c(u^)]<p*  -f  e.  Then,  using  assumption  3,  we  obtain  E[/l(a;)||ujl+.m  —  u{l||2]<e, 
Vm  >  0.  Therefore,  (u^)  is  a  Cauchy  sequence  in  Lj( XI,  9,  p).  By  the  closedness  of  U  and  com¬ 
pleteness  of  Li  spaces,  there  exists  a  (/-valued  random  variable  u‘  such  that  limn_^eU^  *  u',  in 
La(fl,?F,#*)  and,  therefore,  in  probability,  with  respect  to  9.  (The  proof  of  the  last  implication  is 
contained  in  the  proof  of  Lemma  2.) 


Fix  some  v  £  U.  Then,  {E[c(v)|9}J,  n  =  1,2,...}  is  a  uniformly  integrate  martingale  [8, 
Theorem  T19,  p.86|.  Since  E(c(u|l)|<J,„]  <E(c(t>)|'J51],  a.s.,  and 

^l^k«U,)in+I]|9iJ<£:^lc(<)|^+1]|?rJ  =  E[c(ui,)|5’j,  (16) 

jj,  n-=s  1,  2, , . . .}  is  a  uniformly  integrable  supermartingale,  with  respect  to 
Let  j  €  A(i).  Let  m/t,  nk  be  the  times  of  transmission  and  reception,  respectively,  of  the  fc-th 
message  from  j  to  i.  Because  of  assumption  9,  both  mk  and  nk  are  stopping  times  of  {?£}, 
respectively,  and  since  j  €  A(i),  they  are  almost  surely  finite  stopping  times,  for  all  k.  Moreover, 
k<mk<nk  and  by  the  optional  sampling  theorem  [8,  Theorem  T28,  p.90] 

E[C(u‘)]>E[C(u‘J]>p*- 


which  shows  that  lim/b_oo E(c(uJ^)]  =  g‘.  Similarly,  Iimfc_0O£[c(u£lfc)]  =  g*. 

Note  that  u]mk  is  “J^-measurable  and,  by  Lemma  3,  E(c(uf„J]  >  £7(c(uJ,fc)J.  Taking  the  limit, 
we  obtain  g j  >  g',  and  by  assumption  5,  g'  =  g'  =  g,  Vt ,j. 

We  now  take  the  limit  of  the  inequalities 

jemo + 4<)i  >  (i7) 

to  obtain  limfc^EJcftu^  +  ujJ/^)]  =  pand,  by  Lemma  2,  limfc_00(u^  —  u£j  =  0,  in  Li[Q,^,n) 
and  in  probability. 

We  also  take  the  limit  of  the  inequalities 


2£lc(<*)  +  c(' 


viH  sewOI 


to  obtain  lim^-^oE^uj.  +  u*J/2)]  =  g*  and,  by  Lemma  2,  lim/t—ooCu^  —  ti£)  =*  0.  Similarly,  we 
obtain  limfc_ao(tt^l  —  uj)  =  0,  which  shows  that  u*  —  u3,  almost  surely.  | 


,  Fix  some  v  6  U  and  let  B  =  {«:  u‘(w)  =«*  «}.  Then,  E[c(t;)!^jJ  < 


Vo*  6  U,  for  almost  all  u  €  B.  By  the  martingale  convergence  theorem  [8,  Theorem  T17,  p.84],  we 


conclude  that,  for  almost  all  u  €  B,  there  exists  some  lV(w)  such  that 


EW»)|5y<EI4»,)l»|J 


Therefore,  u'n(u)  =  v,  for  almost  all  a;  €  5  and,  by  considering  the  other  elements  of  U  as  well, 
Iimn_c»u5,  =  u‘,  almost  surely. 

If  j  £  A(i),  uJ  is  5^-measurable  and  £[c(uJ)]  >  £[c(u')].  By  assumption  5,  E[c(u')]  —  E^tt7)], 
Vi,  j.  Therefore,  for  j  £  A(t),  uJ  minimizes  E(c(to)]  over  all  ^-measurable  random  variables  and  by 
the  assumptions  of  the  theorem,  u‘  =  tzJ,  almost  surely.  Using  assumption  5  once  more,  we  obtain 
u‘  =  uJ,  Vi,j.  1 

Proof  of  Theorem  7:  As  is'usual  in  linear  least  squares  estimation,  we  use  the  setting  of  Hilbert  spaces 
of  square  integrable  random  variables.  Let*  G  be  a  Hilbert  space  of  zero  mean,  jointly  Gaussian  ran¬ 
dom  variables  on  (Cl,  9, 9)  such  that  each  component  of  the  unknown  vector  *  and  the  observations 
belong  to  G.  The  inner  product  in  G  is  defined  by  <  *,  y  >=  E[xy\. 

For  each  agent  i,  let  H *  denote  the  smallest  closed  subspace  of  G  containing  all  observations 
obtained  by  him.  Let  H'n  be  the  smallest  closed  subspace  of  G  containing  all  observations  obtained 
by  agent  i  up  to  time  n.  (Note  that  H'n  does  not  contain  ail  random  variables  known  by  agent 
i  at  time  n,  because  it  does  not  need  to  contain  any  of  the  messages  received  by  agent  t.  Note 
also  that,  by  assumption  7,  Hln  C  C  H%  and  that  's  the  total  knowledge  available 

to  all  agents,  where  £  denotes  the  direct  sum.  The  centralized  estimate  is  the  projection  of  *  on 

{Hk.)  We  assume,  without  loss  of  generality,  that  *  is  a  scalar  random  variable,  since  each 
component  can  be  separately  estimated. 

Let  i'n  =  E\x |9{J  and  e'n  =  x  —  x'n  and,  by  the  orthogonality  of  errors  and  observations,  we 
have  E[xy]  =  E[x'ny],  Vy  £  H'n.  Similarly  with  previous  proofs,  we  have  II<-hII2<I.I<II2,  Vn,» 
which  implies  that 

imi!  >  iiiu.ii’  >  iinn1  (u) 

In  particular,  (19)  implies  that  {zJJ  is  a  norm-bounded  sequence.  By  the  weak  local  sequential 
compactness  of  Hilbert  spaces  [11,  p.126],  (i),}  contains  a  weakly  convergent  subsequence  {2^}. 
In  other  words,  there  exists  an  element  £3 0  £  (7  such  that  <  y,  >  converges  to  <  y,  2^  >, 
Vy  €  G.  Moreover,  x'n  £  C  Hk  and  since  d°Md  subspaces  are  also  weakly  closed 

[11,  Theorem  11,  p.125],  6  Udi#*  Now  let  y  £  ff‘n.  Then,  <  y.i^  >*<  y,*  >,  for  all 

l  such  that  ni  >  n,  which  implies  that  <  y,  2*,  >*<  y,x  >.  Moreoever,  the  sequence  of  spaces 
{#*,}  generates  //*  which  implies  that  <  y,  £»  >=*<  y,  *  >,  Vy  £  H*. 

By  theorem  1,  (*‘n  —  x£)  converges  in  mean  square  (and  therefore  weakly)  to  zero,  which  implies 
that  2ao  is  also  a  weak  limit  point  of  {2*,}.  The  same  argument  as  before  shows  that  <  y,  >» 
<  y,  *  >,  Vy  £  Vy.  Therefore,  <  y,  2*,  >“  <  y,x  >,  Vy  £  Hk-  But  this  is  exactly  the 
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condition  that  x^  is  the  centralized  estimate,  given  the  observations  of  all  agents.  So,  {xj,}  has  a 
unique  weak  limit  point,  which  is  the  same  for  all  i  and  coincides  with  the  centralized  estimate. 

It  only  remains  to  show  that  converges  to  strongly  (in  the  mean  square).  We  know  from 
[11,  p.120]  that  ||xeoj|<liminf„^00||5:[lj|.  On  the  other  hand, 

IMP  -  IIM*  =  II*  -  M!<ll*  -  i‘nll!  -  IMP  -  iMjP  (20) 

which  shows  that  Ijxoojl  >  limsup^.,^  ||iJ,|J.  Therefore,  ||ioo||  =  limn_co||i|l||  and  by  theorem  8, 
p.124  of  [J  l],  we  conclude  that  lim„_oo  ||£),  —  ioo||2  ==  0.  | 

Proof  of  Theorem  8:  We  use  again  the  Hilbert  space  formalism  of  the  previous  proof.  Let  G\  be  the 
subspace  of  G  describing  the  knowledge  of  agent  i  at  time  n  (both  his  observations  and  the  messages 
he  has  received).  By  assumption  8,  we  have  £?(,  C  C  G.  Since  G  can  be  chosen  to  be  finite 
dimensional,  there  exists  some  M  (depending  on  the  sequence  of  communications  but  deterministic) 
such  that  <7\f+n  =  H'm,  Vn  >  0,  Vi.  Equivalently,  £\f+n  =  x'M,  Vn  >  0,  Vi,  and  by  theorem  7, 
~  *oo*  ^l*>  3'  i 
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